Bloscpack: a compressed lightweight serialization format for numerical data
نویسنده
چکیده
This paper introduces the Bloscpack file format and the accompanying Python reference implementation. Bloscpack is a lightweight, compressed binary file-format based on the Blosc codec and is designed for lightweight, fast serialization of numerical data. This article presents the features of the file-format and some some API aspects of the reference implementation, in particular the ability to handle Numpy ndarrays. Furthermore, in order to demonstrate its utility, the format is compared both featureand performance-wise to a few alternative lightweight serialization solutions for Numpy ndarrays. The performance comparisons take the form of some comprehensive benchmarks over a range of different artificial datasets with varying size and complexity, the results of which are presented as the last section of this article.
منابع مشابه
Flat triples approach to RDF graphs in JSON
This paper describes a syntax that can be used to write Resource Description Framework (RDF) graphs for presentation and editing purposes. It propose a alternative mean of serializing RDF triples using JavaScript Object Notation (JSON), a lightweight representation format which emphasizes legibility and brevity. RDF/JSON is a textual syntax for RDF that allows RDF graphs to be completely writte...
متن کاملObject Serialization: A Study of Techniques of Implementing Binary Serialization in C++, Java and .NET
The process of converting a data structure or object state into a storable format is referred to as serialization. The resurrection of the stored data in the same or another computer environment is referred to as deserialization. Binary Serialization is converting the object in binary format and being able to store it in a storage medium. Many programming languages provide interfaces for serial...
متن کاملJSON on Mobile: is there an Efficient Parser?
The two largest causes for battery consumption on mobile devices are related with the display and network operations. Since most application need to share data and communicate with remote servers, communications should be as lightweight and efficient as possible. In network communication, serialization plays a central role as the process of converting an object into a stream of bytes. One of th...
متن کاملRDFa2: Lightweight Semantic Enrichment for Hypertext Content
RDFa is a syntactic format that allows RDF triples to be integrated into hypertext content of HTML/XHTML documents. Although a growing number of methods or tools have been designed attempting at generating or digesting RDFa, comparatively little work has been carried out on finding a generic solution for publishing existing RDF data sets with the RDFa serialisation format. This paper proposes a...
متن کاملObject Serialization Formats and Techniques a Review
Serialization is a process of converting an object into a stream of data so that it can be easily transmittable over the network or can be continued in a persistent storage location. This storage location can be a physical file, database or Network Stream. This paper concludes some the work that is going on in the field of Object Serialization. This paper presents Object Serialization Technique...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1404.6383 شماره
صفحات -
تاریخ انتشار 2014